3 research outputs found
A Proof-Theoretic Approach to Scope Ambiguity in Compositional Vector Space Models
We investigate the extent to which compositional vector space models can be
used to account for scope ambiguity in quantified sentences (of the form "Every
man loves some woman"). Such sentences containing two quantifiers introduce two
readings, a direct scope reading and an inverse scope reading. This ambiguity
has been treated in a vector space model using bialgebras by (Hedges and
Sadrzadeh, 2016) and (Sadrzadeh, 2016), though without an explanation of the
mechanism by which the ambiguity arises. We combine a polarised focussed
sequent calculus for the non-associative Lambek calculus NL, as described in
(Moortgat and Moot, 2011), with the vector based approach to quantifier scope
ambiguity. In particular, we establish a procedure for obtaining a vector space
model for quantifier scope ambiguity in a derivational way.Comment: This is a preprint of a paper to appear in: Journal of Language
Modelling, 201
A Compositional Vector Space Model of Ellipsis and Anaphora.
PhD ThesisThis thesis discusses research in compositional distributional semantics: if words
are defined by their use in language and represented as high-dimensional vectors
reflecting their co-occurrence behaviour in textual corpora, how should words be
composed to produce a similar numerical representation for sentences, paragraphs
and documents? Neural methods learn a task-dependent composition by generalising
over large datasets, whereas type-driven approaches stipulate that composition
is given by a functional view on words, leaving open the question of what those
functions should do, concretely.
We take on the type-driven approach to compositional distributional semantics
and focus on the categorical framework of Coecke, Grefenstette, and Sadrzadeh
[CGS13], which models composition as an interpretation of syntactic structures as
linear maps on vector spaces using the language of category theory, as well as the
two-step approach of Muskens and Sadrzadeh [MS16], where syntactic structures
map to lambda logical forms that are instantiated by a concrete composition model.
We develop the theory behind these approaches to cover phenomena not dealt with
in previous work, evaluate the models in sentence-level tasks, and implement a tensor
learning method that generalises to arbitrary sentences.
This thesis reports three main contributions. The first, theoretical in nature, discusses
the ability of categorical and lambda-based models of compositional distributional
semantics to model ellipsis, anaphora, and parasitic gaps; phenomena that
challenge the linearity of previous compositional models. Secondly, we perform an
evaluation study on verb phrase ellipsis where we introduce three novel sentence
evaluation datasets and compare algebraic, neural, and tensor-based composition
models to show that models that resolve ellipsis achieve higher correlation with humans.
Finally, we generalise the skipgram model [Mik+13] to a tensor-based setting
and implement it for transitive verbs, showing that neural methods to learn tensor
representations for words can outperform previous tensor-based methods on compositional
tasks